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Abstract. The Backhand Transform, first developed in the context of differential geometry, has 
been classically used to obtain multi-soliton states in completely integrable infinite dimensional 
dynamical systems, ft has recently been used to study the stability of these special solutions. We 
offer here a dynamical perspective on the Backlund Transform, prove an abstract orbital stability 
theorem, and demonstrate its utility by applying it to the sine-Gordon equation and the Toda 
lattice. 

1. Introduction 

In this paper we survey some recent work on the use of Backlund transformations to study the 
stability of localized structures in infinite dimensional Hamiltonian systems. For finite dimensional 
Hamiltonian systems the constraints imposed by the Hamiltonian structure mean that the stability 
of stationary solutions can be reduced either to showing that all the eigenvalues of the linearized 
system at the fixed point lie on the imaginary axis (for spectral stability) or that the full, nonlinear 
system exhibits Lyapunov stability. The stability of periodic orbits can be studied by similar 
methods by reducing the problem to the consideration of a fixed point of a Poincare map. 

For infinite dimensional systems the situation can be more subtle. There, the possible presence of 
dispersive phenomena means that one may have asymptotic stability of such systems, in appropriate 
norms, a phenomenon that is impossible in finite dimensional systems. 

Consequently, the study of stability in such systems has followed two rather different tracks. 
On one hand, methods to prove Lyapunov (or orbital) stability of localized solutions like traveling 
waves, solitons, or multi-solitons have been developed which rely on regarding the solution as a 
minimizer, or critical point, of some energy functions, often subject to appropriate constraints. 
Examples of this type of approach are [2], [3], [4], [5]. 

The second approach typically begins by analyzing the linearization of the system about the 
solitary wave. The spectrum of the linearization is then considered and one shows that on the 
complement of the point spectrum the linearized evolution generates a dispersive evolution. If 
the dispersive decay is sufficiently rapid, this can then (sometimes) be used in conjunction with 
DuhamePs formula to derive nonlinear, asymptotic stability of the underlying solitary wave. Ex- 
amples of this approach include [6], [7]. Closely related to this approach are stability or instability 
results based on invariant manifold theorems [8], [9]. Here, one typically shows that the nonlin- 
ear equations possesses an invariant manifold associated with the family of localized solutions and 
examines the behavior of solutions near this manifold to understand stability properties of the 
underlying family. 

Recently, an old tool has been adapted to study both of these types of stability, namely Backlund 
transformations. Backlund transformations define a relationship between two functions (often, 
through a differential equation, or some more complicated equation) such that if one of the functions 
satisfies a given partial differential equation, so does the second. The partial differential equation 
satisfied by the second function may be the same PDE satisfied by the first function (in which case 
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one speaks of an auto-Backlund transform ) or it may be a different PDE. In the study of infinite- 
dimensional Hamiltonian systems, Backlund transforms have mostly been used in the context of 
completely integrable infinite dimensional systems to obtain explicit formulas for soliton, multi- 
soliton, or other special solutions of the equations. Thus, for instance, the auto-Backlund transform 
for the Korteweg-de Vries (KdV) equation relates the zero solution to the one-soliton solution, the 
one-soliton solution to the two-soliton solution and so on and so forth. 

However, the Backlund transformation is also turning out to be a useful tool to investigate the 
stability of such special solutions as well. It may not be clear at first glance why this is of interest. 
Since in principle, one knows "everything" about solutions of a completely integrable system, the 
stability or instability of such solutions might seem an obvious by-product of their integrability. 
In practice, however, it may be difficult to see from the formulas defining the solutions in these 
integrable systems what the asymptotic behavior of solutions with initial conditions close to a 
soliton are. Furthermore, stability results based on Backlund transformations have yielded at least 
two new insights not available from the complete integrability machinery: 

• First, in some circumstances, they allow one to establish stability in much less regular spaces 
than can be treated either with completely integrable structure, or with energy methods. 
A first example of this approach is the work of Merle and Vega, [10]. They used the 
Gardner transformation which maps solutions of the KdV equation close to the soliton into 
solutions of the modified KdV equation near a kink solution. (The Gardner transformation 
is an example of a Backlund transformation which links two different equations.) They 
then use the stability of modified KdV kinks in the energy space, plus the fact that the 
Gardner transformation also maps L 2 solutions in KdV into solutions of modified KdV 
to conclude that KdV solitons are actually stable in L 2 . This approach has since been 
extended to conclude that multi-soliton solutions of KdV are also stable in L 2 , [11], and 
also that the soliton solution of the nonlinear Schrodinger equation is stable in L 2 , [12]. 

• A second advantage of Backlund transformation methods is that they can sometimes be 
used as the starting point for a perturbative argument which yields insight into the behavior 
of other non-integrable systems. Thus, the Backlund transformation-based study of the 
stability of soliton solutions of the (integrable) Toda-lattice in [13] served as the basis for a 
simple proof of stability of solitary waves in a general class of non-integrable Fermi-Pasta- 
Ulam models [14]. 

While this paper will focus on rigorous applications of the Backlund transformation method it is 
worth noting that similar ideas have been used in non-rigorous settings (sometimes in advance of 
the rigorous applications) to compute explicit approximate solutions with initial conditions close 
to solitary waves. Thus, in [15], Mann used a linearized Backlund transformation to compute the 
Green's function for the KdV equation linearized about the soliton solution and then in turn used 
this to study the evolution of initial conditions close to the soliton. Likewise, Tsigaridas, et al [16] 
make a more general study of this same question and apply these ideas to compute approximate 
solutions of both the nonlinear Schrodinger equation and KdV equations with the aid of linearized 
Backlund transformations. 

The classical view of the Backlund transform for the sine-Gordon equation is geometric in nature. 
It relates angles between curves of zero curvature on patches of pseudo-spherical surfaces. As is 
common in differential geometry, partial differential equations arise. Here the partial differential 
equation relates the aforementioned angle as a state variable to the coordinates on the manifold as 
independent variables. Thus the geometric relationship between these angles on a pair of psuedo- 
spherical surfaces manifests in the PDE world as a relationship between a pair of solutions. 
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The zero solution is related in this way to a family of monotone front solutions u(t, x) = u c (x — 
ct — 5) which connect and 2ir. In one physical model, the state variable u in the sine-Gordon 
equation corresponds to the angle by which an elastic ribbon is twisted from vertical at position x 
and time t. The front solution obtained from the zero solution via Backlund transform (a maneuver 
which naively appears to have everything to do with the geometry of pseudo-spherical surfaces and 
nothing to do with the twisting of elastic ribbons) thus corresponds to an elastic ribbon that has a 
full twist, or kink, and is commonly called a kink solution. Applying the Backlund transformation 
to the kink solution can now produce a solution which is asymptotic to and 4tt at spatial ±oo, 
i.e. has two kinks. This so-called two-kink solution resolves as t — > ±oo to the linear combination 
of two well-separated kink solutions, each traveling with its own characteristic speed. Moreover, 
the characteristic speeds of the kinks are identical at temporal ±oo but the phases are allowed to 
vary. Thus we can regard the two-kink solution as capturing an interaction in which a fast steep 
kink overtakes a slow shallow kink with only a phase shift (as opposed to excitation of dispersive 
modes) to show for the nonlinearity. Repeated application of the Backlund transform can produce 
multi-kink solutions which resolve into linear combinations of multiple kink solutions much as 
multi-soliton solutions resolve into linear combinations of solitons. 

It is well-known to experts that the perspective of the Backlund transformation is a very useful for 
constructing multi-soliton solutions. In studying the stability of these kink and multi-kink solutions, 
however, the theory of dynamical systems is necessarily brought in and from this perspective the 
classical view of the Backlund transform is not entirely natural. The reason for this is as follows. The 
typical strategy of proof in the nascent literature of stability via Backlund transform is effectively 
to conjugate the flow about a soliton or multi-soliton (or the linearization thereabout) with the 
flow about the zero solution, leveraging the stability of the zero solution to obtain the stability of 
the soliton or multi-soliton. The problem from the perspective of dynamical systems is that when 
conjugating a flow one makes use of a map that acts on the phase space and not on the much 
larger space of trajectories in the phase space. One of the key ideas in this paper is to redefine the 
Backlund transform as a map that acts on the phase space. An orbital stability result for solitons 
and multi-solitons then follows very quickly from this definition with the aid of well-developed and 
classical ideas in dynamical systems. 



2. Abstract orbital stability 

Definition Let X and Y be open subsets of affine subspaces of Banach spaces and let $(t) : X — > X 
and ^f(t) : Y — > Y be semiflows. Let A be a finite dimensional manifold and let Z be a Banach 
space. Let F : X x Y x A ^ Z be a C 2 function such that for each AG A, M\ := F(-, •, A) _1 (0) is 
an invariant set for the product flow & x ^ : X xY — > X xY. Assume further 

(HO) there is some (x, y, A) G X x Y x A such that F(x, y, A) = 0. 

(HI) D x F(x,y, A) : T X X — > Tpi x y \\Z is Fredholm and injective whenever F(x,y,X) = 0. 
(H2) Dy t \F(x,y, A) : T y Y — > Tpr xy \\Z is an isomorphism whenever F(x,y, A) = 0. 

Then we say that F Backlund-conjugates the flows $ and ^. In the case that <I> = ^ we say 
that F auto-Backlund-conjugates <!> with itself. 

Theorem 2.1. Assume (H0)-(H2). Let (x, y, A) be given as in (HO) and let H C Y be an invariant 
manifold for ^ that contains y and is stable in sense of Lyapunov: There is an eq > such that for 
each e 6 (0, £o] there is a 5 = 5(e) > such that for any t > 0, we have dy(^(t)y, H) < e whenever 
dy(y,H) < 5. Assume further that 
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CI Given any compact subset Ao C A there is a constant C such that for each (x, y, A) G F 1 (0) ; 
there is a subspace X\ complementary to the kernel of D x F(x, y, A) such that 

'D x F{x,y,\)\ x y l \<C 

with the estimate uniform among y G Y with dy(y,H) < S(eo) := 5q, x £ X and A G A 
with F(x,y,X) = 0. Furthermore the estimate 

\\D x F(x , y , A ) - D x F(x 1 ,yi, Ai)|| < C(d x (x , x{) + d Y (yo, Vi) + d A (Xi, A )) 

holds uniformly among xq, x\, yo, yi, Ao, Ai with dy(yj,H) < 5q and (xj,yj,Xj) G i ? ~ 1 (0). 
C2 the norm of (D y \F(x, y, A)) -1 is bounded above uniformly among (x, y, A) such that dy(y, H) <| 

S(so) and A C Ao compact and F(x,y,X) = 0. Furthermore, D y; \F(x,y, A) is uniformly 

Lipschitz among (x,y,X) G F _1 (0) suc/i t/iat dy{y,H) < So 
C3 i/te norm of D x F(x,y, X) is bounded above uniformly among (x,y,X) with dy(y,H) < 5(eo), 

with A C Ao compact and F(x, y, A) = 0. 

Then there is an invariant manifold M for $ containing x, a function A* : M — > A, a decompo- 
sition of M into invariant manifolds M x = (A*) _1 (A), as well as a constant C such that 

d x (<S>(t)x,M x *W) < Ce 

whenever dx(x, M) < ^S. Moreover, M x is precisely the set of x G X such that F(x, y, A) = for 
some y £ H . 

Proof. Let (x,y,X) be given as in (HO). It follows from (H2) and the implicit function theorem 
that there are smooth functions y* and A* mapping a neighborhood of x to neighborhoods of y and 
A respectively such that F(x,y* (x), X*(x)) = 0. Furthermore, these functions are unique in that if 
(x, y, A) is close to (x, y, A) and F(x,y,X) = 0, then y = y* (x) and A = A* (x) . 

We claim that in addition, the functions y* and A* can be extended to some maximal domain 
such that on this domain the range of y* contains a 5-neighborhood of H. To establish this 
claim, first note that (H2) allows us to enlarge the domain on which y* and A* is defined by 
applying the implicit function theorem with base point (x,y*(x), X*(x)) for any x in the domain 
of y* and A*. Furthermore, one can observe from the proof of the implicit function theorem 
that diameter of the neighborhoods on which the implicit functions y* and A* are defined can be 
taken to be 2\\D x F(x,y, X)~ 1 \\LipD x F. In light of condition (CI), this diameter is uniform among 
(x,y, A) G F _1 (0) for which dy(y,H) < do- This establishes the claim: by repeatedly applying the 
implicit function theorem we can enlarge the domain of y* and A* sufficiently so that the range of 
y covers a neighborhood of H. 

Note that 

L)SU ) =-D y ,xF(x,y*(x),X*(x))- 1 F x (x,y*(x),X*(x)), 

and hence the implicitly defined functions y* and A* are Lipschitz on any set for which x *— > 
\\Dy j \F(x,y*(x) 1 X*(x))~ 1 \\ and x >->■ \\F x (x,y*(x),X*(x))\\ enjoy a uniform bound. Define M = 
(y*)^ 1 (H) and with the notation H$ = {y | dy(y,H) < 6} define for S sufficiently small Ms = 
(y*)^ 1 (Hs). It follows from conditions (C2) and (C3) that y* and A* are Lipschitz on Ms C X for 
S<5(e ). 

Let T X X = K x X\ be a Lyapunov-Schmidt decomposition of T X X subordinate to D x F(x, y, A). 
Here K denotes the kernel and X\ is chosen as in (CI). It follows from (HI) that there is a smooth 
implicitly defined function x\ taking a neighborhood of (y, A, 0) G Y x A x K to X\ such that 
F(x\{y, A, k) + k,y,X) = for any k in the given neighborhood of {0} C K. Since the range of y* 
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contains H$, it follows that y = y*(xl(y,X,k) + k) and more specifically that for y G H we have 
x\{y, X, k) + k £ M whenever this quantity is defined. 

Because of (HI) the point (x,y, A) is not distinguished among points in i ?_1 (0). Thus given 
any (x,y,X) G F -1 (0) there is a similar Lyapunov-Schmidt decomposition T X X = K x X\ and a 
similar implicitly defined function x*. It follows from (CI) that x\ has a Lipschitz constant which 
is uniform in the choice of base point (x, y, X) for the implicit function theorem. 

Let C denote the Lipschitz constant of y*. Since we have assumed in the statement of the theorem 
that x is (5/C-close to M it follows that y*{x) is <5-close to H. Recall that this is the neighborhood 
of Lyapunov stability for H corresponding to the given small number e: {t)y* (x) , H) < e. 

Since the pair (x,y*(x)) lies on the invariant manifold M\*( x ) = F(-, -, A*(x))" 1 (0) it follows that 

F(*(t)x,*(t)y'(x),\'(x))=0, 

hence that <&(t)x = xl(^(t)y*(x), X*(x), k) + k for one of the local functions x\. 
We now establish the Lyapunov stability of M: 

d x ($(t)x,M) < d x (<!>(t)x,x$(y*(x),X*(x),k) + k) 

= d x (xt(*(t)y*(x),\*(x), k) + k, xt(y*(x),X*(x), k) + k) 

< Lipx* 1 d Y (^{t)y*(x),y*{x)) 

< eLipxJ 

In the first line we have used our characterization of M. In the second line we have used that 
F _1 (0) is invariant for the product semiflow. In the third line we have used that x^ is Lipschitz 
and in the fourth line that H is Lyapunov-stable for the semiflow ^. □ 

3. Examples of orbital stability 

3.1. Sine-Gordon equation. As a first application of Theorem 2.1, we consider the orbital sta- 
bility of the kink solutions of the Sine-Gordon equation. While the stability of these solutions is 
not surprising and could probably be proved using the energy methods discussed above, it gives a 
simple illustration of our approach. Furthermore, as we indicate at the end of this section, we sus- 
pect that with some additional work this method will also yield the stability of multi-kink solutions 
for this equation. 

The classical Backlund transform for the Sine-Gordon equation relates two solutions u and u' by 
the pair of equations 

u x — ut = u' x — u' t + 2a sin( — - — ) (3.1) 

_/ _/ 2 u — u\ 
u x + u t = —u x — u t H — sm( 

:e 

transform can be written as 



a v 2 ' 

If we introduce phase space variables u = u, v = u% and u' = v! , v' = u' t , we see that the Backlund 



rv / / ^ ( u x + v'-asm(^- - ±sin ^ \ n ,„ nX 

F(u,v,u ,v ,a) = I , i . )u- u '{ a . ; u + u '( =0 3 - 2 

V v + u' x - ism(Y-) +asm(^±2L) J 

and it is to this function that we apply Theorem 2.1 

Recall that the Backlund transform for the Sine-Gordon equation maps the zero solution to the 
1-kink solution and then successively maps the fe-kink to the k + 1-kink, for any positive integer k 
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[17]. With this in mind, let X = {(u,v) | sin(|) <E H 1 and v G L 2 and u(— oo) = 0}. Consider the 
decomposition X = U^L^Xfc where X fe = {(u,v) G X | u(oo) = 27rfc}. 3 Let <&(i) = \I/(i) denote 
the time t map for the sine-Gordon equation uu = u xx — s'mu and let Z = L 2 x L 2 . Given the 
properties of the /c-kink solution, it is natural to study its evolution in the space X^. For the time 
being, since we want to concentrate on the 1-kink solution we will focus on the spaces Xq and X\, 
and consider the function: 

F:XixI x(0,l)^2 (3.3) 

We now have: 

Theorem 3.1. Let u be a 1-kink solution for the sine-Gordon equation, let e > be given and let 
u° be e-close to u(t) in H 1 x 1? for time t = 0. (i.e. (uq — u(0)) is small in H 1 x L 2 ). Let u l 
denote the time- evolution of u . Then for all time, u l remains ^/e -close to some 1-kink solution 
with to the speed of u 

Proof. We first check that hypotheses (HO), (HI), and (H2) are satisfied. We take y = (u',v') = 
(0,0), and then we can solve explicitly for those points x = (u,v) for which F is zero and we find 
u = 4arctan(exp(ax + 5)), v = 4av exp(ax + 8)/(l + exp(2(ax + 5)), where v and a are related by 
ay/l — v 2 = 1. (Note that this calculation insures that (HO) is satisfied.) However, we will use 
this explicit form of the kink solution very little, in order to set the stage for our discussion of the 
stability of the /c-kink solution later in this section. 
Now differentiate F with respect to (u, v) to obtain: 

/^-§cos(^)-^cos(^) \ 
D(u,v)F(u,v,v! ,v' , ,a) = (3.4) 

V a x -icos(2±^) + §cos(J^) 1/ 

To invert this operator we need to solve the system of ODE's: 

a x -fcos(^)-^cos(^) 0\ 

^-^cos(^) + fcos(^) 1 / 

Note that the operator is lower triangular, and the second row gives ip in terms (p x , a bounded 
(invertible) multiplication operator acting on <fi, and g. 

Thus, we focus on the first row which gives <p as the solution to a first order non-autonomous 
ODE (in x) with inhomogeneous term given by /. More precisely, we must solve 

fa ,u + v! 1 u-v! 

o x d> — -cos( H cos ( 

Y \2 v 2 1 2a v 2 1 

We analyze this equation, and similar equations below with the aid of the following lemma. 
Lemma 3.2. Consider the ODE 

u x - a(x)u = f(x) . (3.7) 

(1) Assume that a± = lim x ^± OQ a(x) are defined with ct_ > > a + . Assume further that 
J °° \a(t) — a + \dt < oo, and J ^ \a(t) — a>-\dt < oo Then there exists a constant C a , such 
that the unique solution of (3.7) with u(xq) = satisfies \\u\\ H i < CaWfUi 2 - 




(3.6) 



'Note that the fact that sin(u/2) 6 H 1 implies that the jump in u from — oo to oo is an integer multiple of 2it. 
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(2) Assume that a_ < < a + , with \a{x)\ < k < oo and that j R f(t)(f){t)dt = for the 
unique (up to contant multiple), non-zero, bounded (f> solving the adjoint ODE (j) = — a{t)(j). 
Then there is a unique choice of uq for which u G L 2 and for this choice of uq we have 
\\u\\m<C a \\f\\ L 2. 

Remark This lemma is a simple and explicit example of the relationship between Fredholm proper- 
ties of operators and exponential dichotomies, which has been very useful in the theory of dynamical 
systems [18]. In particular, the fact that we require a(x) to converge to its limiting values in L 1 is 
a very natural assumption in this context. 

Proof, (of Lemma) Define /u(x) = exp(— a(t)dt). Then the unique solution of (3.7) with u(xq) = 
uq is 

u{x)=u Q /^x)+ r^lf(y)dy. (3.8) 

Write u = u > + u < , where u > (x) = u{x) for x > xq, and zero otherwise, and u < {x) = u{x) for 
x < xq, and zero otherwise. Then ||tt||^2 — II II £,2 ~i~ \ I u^\\ 2 ^2- ^Ve will bound ||"U>||^2 and leave the 
estimate on ||n < ||^ 2 as an exercise. 

We now consider specifically the situation in Case 1, where a_ > > a + . Define h > (x) = \h(x)\ 
if x > xq and h > (x) = for x < xq. Likewise define £ > (x) = exp(a + x) if x > and zero otherwise. 
Then we can estimate 

f X 



|u > (x)| = | / e^y a ^ dt h(y)dy\ 

J Xo 



'Xo 

f X e a + (x-y)+f*(a(t)-a + )dt h ^ d ^ 
J xo 



< 



pX pOO 

C+ / e a +^\h(y)\dy = C+ / £ + {x - y)h>(y)dy 

J Xq J —OO 

From this last expression we immediately obtain ||ti > ||z / 2 < C+Hf+H^i ||/i > ||l2, from Young's in- 
equality. The L 2 norm of the derivative of u > can be estimated in a similar fashion which completes 
the estimate of the H 1 norm, and the proof of Case 1. 
Now turn to Case 2. Rewrite (3.8) as 

px 

H(x)u(x) = u + n{y)f{y)dy (3.9) 

Jxo 

The assumptions on a imply that fi(x) — > as \x\ — > oo and thus, in order for the solution u(x) to 
be bounded we must have 

poo p — oo 

+ / Kv)f{y)dy = u + ti(y)f{y)dy = , (3.10) 

Jxo Jxo 

which uniquely defines uq provided 

poo p—oo poo 

0=/ ii{y)f{y)dy- ^{y)f{y)dy = n{y)f{y)dy (3.11) 

Jxo Jxo J —oo 

Note that fi(x) is the solution of the adjoint ODE, and hence the hypothesis of Case 2 is satisfied. 
The resulting bound on the norm of u then follows as in Case 1. □ 

We now apply the Lemma to (3.6) where we see that 

a ,u + u\ 1 u-v! 
«W=2 COS (— ) + -»(—) (3.12) 
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Recalling that u' G H 1 and that u is an H l perturbation of the kink, we see that u ± v! will 
approach their limits as x — > ±00 in I? and hence that cos( 2 ^ L ) will approach their limits in L 1 
as required. Furthermore, a± = =f(| + ^-), so we are in Case 1 of the Lemma. Thus, (3.6) can be 
solved for any / G L 2 , which gives the invertibility of D U>V F and verifies (HI). 

Next consider hypothesis (CI) of Theorem 2.1. Note that from the calculation above, we see 
that D( U V }F does have a one-dimensional kernel spanned by 

4>{x) = n(x) and ip(x) = ^(-^) cos( "* M ) - | cos( ^ - - )j n{x) - n'{x). 

Consider the subspace orthogonal to this kernel. Note that we need a uniform bound on the inverse 
of D( U ^F only for (u',v') 6 H = {(0,0)} and for a in some compact subset of (0,1). A bound 
on the inverse is easily derived from the proof of the lemma and is proportional to the constant 
C a which we again see from the proof of the lemma is determined by \ a(t) — a±\dt. Since 
v! = 0, we know that u(x) is given by a 1-kink solution of the Sine-Gordon equation and hence 
a(x) = I cos(|) + cos(tj). If we choose the point xo to be the midpoint of the kink, then it is 
easy to show that the quantities J^°° \®{t) — a±\dt, and hence the constants C a , can be bounded 
uniformly for all kinks with parameter a in some compact subinterval of (0, 1). To verify hypothesis 
(CI) it remains only to check that the derivative Dr u ^ v \F is Lipschitz, but this follows from the fact 
that cos regarded as a function R — >■ R is Lipschitz together with the fact that the operator norm 
of a multiplication operator is bounded by the L°° norm of the the function by which it mutiplies. 
We now turn to (H2) and (C2). In this case, we must solve the equations 

D {u ,y ya) F(u,v,u',v',a) ^ I j = ( ^ , (3.13) 

where 

D(u>y, a )F(u,v,u',v',a) = 



_| C o S (H±«:)+ 1 cos(^) 1 -sin(^) + a- 2 



sin(& 



^ + ^cos(^) + fcos(^) a- 2 sin(^)+sin(^) 

(3.14) 

We can solve the first equation to find ip in terms of (f>, f, g and a. Thus, we focus on the second 
equation 

^ , ( a ,U + u\ 1 ,U — u',\ , ,, 

d x <j) + f - cos(^— ) + — cos(^) (f> = g- b(x)5a (3.15) 

where b(x) = a' 2 sin(^) + sin(^_) This is remarkably similar to equation (3.6) except that the 
sign in front of the non-autonomous term ^|cos( !L y L ) + ^ cos^^-)^ has changed. This means 

that we are in Case 2 of the Lemma, rather than Case 1, and in order to solve the equation we 
must check that the right hand side of (3.15) is orthogonal to the solution of the adjoint ODE. In 
this case, one can check that the solution of the adjoint ODE is fi(x) = exp(JJ Q a(t)dt), where in 

this case a(x) = ^| cos( H ±^) + ^ cos( y -^-)^ . We can insure that the RHS of (3.15) is orthogonal 

to fi by picking 5a appropriately, provided b(x)/j,(x)dx 7^ 0. 

So far, we have not found any way of demonstrating that this integral is non-zero for an arbitrary 
choice of u G X\ and v! G Xo- However, if we take u' = and u equal to a 1-kink, we have 
b(x) = 2(l + a~ 2 )exp(ax)/(l + exp(2ax)) > 0. Likewise, /i(x) > for all x, so J^° oo b(x)iJ,(x)dx / 0. 
Since both b and fi depend smoothly on u and v! , this condition will also hold for all u near the 
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1-kink and all v! near the zero-solution. Thus, Theorem 3.1, holds on such a neighborhood. This 
verifies hypothesis (H2). To check (CI) we need only derive a uniform estimate the solution <fi of 
(3.15) for u' = and a in some compact subinterval of (0, 1). This follows in a very similar fashion 
to estimates on solutions of (3.8), and we leave the details as an easy exercise. 

□ 



Remark We note that there is a natural path to attempt to build on the preceding result to 
establish the stability of an arbitrary fc-kink solution. It is known that the Backlund transformation 
(3.1) links the A;-kink to the k + 1-kink. Thus, we can repeat the above proof, this time considering 

F:I 2 xIix(0,l)^2 (3.16) 

and considering the the base point of our theorem (u',v') to be a 1-kink. Then the manifold H is 
the family of 1-kinks. If we then consider the linearizations D^ U ^F and Dr u > v i a \F , the verification 
of (HI), (H2), (CI) and (C2) proceed much as above. The only points that need to be checked are 
the uniform estimates on the inverses. These require uniform estimates on the analogues of (3.6) 
and (3.15). In our estimates of the stability of the 1-kink, we had the freedom to choose the point 
xo to be the center of the kink. This made it simple to establish uniform estimates. For multi-kink 
solutions, there is no such distinguished point and we need an analysis of the form of the 2-kink 
solution to show that for large time, the solution of these equations can be treated essentially by 
regarding the 2-kink as a sum of two 1-kink solutions which were estimated above. This program 
is carried out in detail to establish the stability of the multi-soliton solutions of the Toda lattice in 
[19]. Once the stability of the 2-kink solution is established, it can be used in conjunction with the 
Backlund transformation to establish the stability of the 3-kink, and so-on and so-forth. 



3.2. The Toda Lattice. As a second application of Theorem 2.1 we study the orbital stability of 
the multi-soliton solutions in the Toda Lattice 

qj = Pj - Pj = e <U-i-<U _ e qj-n+i (3.17) 

posed in the energy space £ 2 x £ 2 . Here q and p can be regarded as the position and momentum, 
respectively, of the jth particle in an infinite chain where neighboring particles resist compression 
quite strongly but resist extension only weakly. 

Because of the lattice discreteness, some of the quantities that are most easily used to obtain 
multi-soliton solutions as constrained minimizers of a Lyapunov function in the PDE case are no 
longer conserved and so the Lyapunov function approach is not easily extended. For single solitons 
more detailed stability results have been established, specifically orbital stability with asymptotic 
phase, and moreover asymptotic stability in a weighted space [20]. The techniques used in [20] were 
a combination of the Backlund approach we take here with the dispersive approach mentioned in 
the introduction . Using a linearized version of the Backlund transform, asymptotic stability of 
multi-solitons has also been obtained, albeit in an exponentially weighted space [19]. 

The Backlund transform for the Toda lattice, written in our framework, is 

w( , , s ( Pj +e-y-*-^+e-^-^-2cosh K \ 
F^,P, q ,P,n)= ^. + e -(^-i.) + e -to +1 -^)_ 2ooBhK ) < 3 - 18 ) 

where k is a real parameter. As with the sine-Gordon equation considered above, the zero and 
one-kink solutions (and more generally the m- and m + 1-kink solutions) are related via (3.18) with 
the parameter n controlling the amplitude (2k) and speed sm ^ K of the additional kink. 
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Let = ^ be the propagator for the Toda lattice. Denote the m-kink solution with amplitude pa- 
rameter K m and phase parameters S±, ■ ■ ■ 8 m by (q K ~L<— K ™< s i<"' s ™ ^ «m,<5i,-<5m) Q r more con- 
cisely {q m ,p m ) when the parameters are understood. Given an m-kink solution (q m ,p m ) and an m+ 
1-kink solution (q rn+1 ,p rn+1 ) related via Backlund transform with parameter k: F(q m+1 ,p m+1 ,q m ,p m , 
0, define the affine spaces X m and X m+1 to be the space of £ 2 perturbations of (q m ,p m ) and 
(q rn+1 ,p m+1 ) respectively. 

Theorem 3.3. The Toda-m-soliton is orbitally stable in £ 2 in the sense of Lyapunov: Let M denote 
the 2m -dimensional manifold of m-soliton states with phases free to vary in R and with amplitudes 
constrained to any compact set. Let (Q m ,P m ) denote a point on M. For each e > there is a 
5 > such that whenever (q,p) G £ 2 x £ 2 satisfies \\(Q m ,P m ) — {q,p)\\p x P < 5, then its forward 
evolution &(t)(q,p) under (3.17) satisfies d(&(t)(q,p), M) < e. 

Proof. The proof proceeds by induction. At each stage of the induction we apply Theorem 2.1. At 
the k th stage of the induction, the invariant manifold H is a particular 1-manifold corresponding to 
the temporal evolution of a particular k — 1-soliton while M is a particular 2-manifold corresponding 
to the temporal evolution of a one-parameter family of fc-solitons with the parameter corresponding 
to the initial phase of the additional soliton. The inductive hypothesis is used not to verify the 
hypotheses (H0)-(H2) and (C0)-(C2) which are established by hand at each stage of the induction, 
but rather to verify that H is Lyapunov-stable. This is a natural use of the inductive hypothesis 
because the H used for the k th stage of induction is a submanifold of the M used for the (k — l) st 
stage of the induction and hence Lyapunov-stability is guaranteed by Theorem 2.1. 

We first verify (HO). In the base case we set (q',p') = (0, 0) and solve the equation F(q,p, 0, 0, k) = 

U tor q 3 - log cosh(K(i+1)+7) and p 3 - e ^ cosh(Kj+7+K) - i-J +e y CO sh( K j+ 7 ) L ) ■ iViore geni- 
ally, we set (q',p') = (q m ,p m ) and solve F(q,p,q m ,p m ,K) = for (q,p) = (q m+1 ,p m+1 ). To do this 
computation from scratch is a significant undertaking; we rely on the early literature in the history 
of the Toda lattice [21]. 

Note that in this formulation the differences in the asymptotic values satisfy q^^ — q~oo = and 
q'oo ~ Qoo = —2k. We now check the hypotheses (H1)-(H2). 
To check (HI) we differentiate F and obtain 



D(p yq )F(p,q,p',q',K) = 



e -(9'-9-«) _ g-(9-?_ +«) J 
e -(q'-q-K) _ e -{q+-q'+ft) g q 



Here S is the shift operator (Sq)j = qj + \ and the symbols q± denote the shifted sequences S' ±1 g. 
We must study 

e -(?'-?-«) _ e -(q-q'-+it) t \ 

)(*) = (f 

e -{q'-q-K) _ e -( q+ - q ' +K ) S Q J \V J \9 

obtaining solvability conditions for <fr and ip in terms of / and g. 

Note that (j) is given as a linear combination of / and the action of a multiplication operator 
(bounded from £ 2 — > £ 2 ) on ip. Thus we restrict attention to the second row, which is a first order 
linear difference equation 

= e -(V-9-?+-2K)^ + e (q + ~q'+n) g 

This equation is of the form ipj+i = ctjipj + fj which can be solved explicitly via a summing factor. 
A discrete analog of Lemma 3.2 holds with the relevant numbers now |a±oo| rather than sgn(a±oo): 
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Lemma 3.4. Consider the first order linear recursion u n+ \ — a n u n = f n and suppose that the 
limits a± = lim ra _ i ,± 00 a n exist. 

(1) Suppose that \ oo I ^ 1 ^ I ctoo | and that ^2n= 1 1 ^ n — 1 ^ ^ ^ well as X^n=— oo I < ^ n — I ^ 
oo. T/ien i/iere is a constant C a depending on J2n=o \ a n~ a ±\ such that the unique solution 
u with uo = satisfies \\u\\p < C a \\f\\£2. 

(2) Suppose that |a_oo < 1 < \aoo\ and that Yl^=i \ a n~ «+| < oo as well as Y2n=-oo l a « — a -l < 
oo. T/ien i/jere is a constant C a depending on Yln=o \ an ~ a ±l suc ^ */ / satisfies 
]CneZ /n^n = /or </> i/je unique (up to scalar multiple) solution of the adjoint equation 
4> n -i — ct n (j) n = then there is a unique choice of uq such that u G I 2 and for this choice 
\\u\\p < C a \\f\\ e 2 

Its proof is similar to the proof of Lemma 3.2 and can be regarded as a consequence of the theory 
of exponential dichotomies or as an exercise for the reader. 

We continue now with the proof of Theorem 3.3, computing a± = e~ 2 ^ q± °°~ q±a °~ K ^ = e =F2K . To 
check the hypotheses of the lemma we must show that a — a± is summable over ±N. To that 
end, we compute a - a + = e~ 2K (V(2<?'-<?'s+-4k) _ w q i _ ^ q + 2k ) + q > - ( q+ + 2k). It follows 

from Lemma 2.1 in [19] together with the fact that q + — q is exponentially localized [22], that this 
quantity is in I 1 and moreover approaches 2k in I 1 exponentially fast as t — > oo when q and q' are 
m and m — 1 soliton solutions respectively. We remark that this is one place where the restriction 
to a compact set of k is necessary. These estimates are not uniform in the limit as any of the wave 
speeds goes to zero. 

The coefficients a± satisfy ct_ > 1 > q+ and thus we are in case (1) of the lemma. This 
establishes (HI). In the base case this establishes the uniform bound in (CI) as well; after all 
uniform bounded are not hard to obtain on H x M = {(0, 0)} x M/Z. The constant C a given in the 
lemma depends only upon ^2 n>Q a n — a + which has a limit in i l as t — > oo and hence is bounded 
uniformly as t — > oo. This establishes that the constant C a is bounded along any trajectory in 
H x M n F _1 (0) that corresponds to the temporal evolution of the A;-soliton state under (3.17), 
i.e. it establishes the uniform bound in (CI) when the manifold H in the theorem is the orbit of a 
fc-soliton state. To verify (CI) it only remains to check that the derivative of F is Lipschitz, but 
this is immediate. 

We now check (H2)-(C2). We compute 

/ e -(Q-q'-+K) S -l _ e -{q'-q-K) q e -{q' -q-n) _ e ~{q-q!_+n) _ 2 ginh K 

D q >, p >, K F(q,p,q',p',K) = 

y e -{q+-q+K) _ e -(q'-q-K) j e -(q'-q~K) _ e -(q+-q'+n) _ 2 sinh K 

and solve 

e -(,-^ +/s ) 5 -l _ e -(q'-q-K) o e -{q'-q-K) _ e -{q-qL+K) _ 2 S mh K 



e -(?+-3+«) _ e -(?'-g-«) / e -(?'-g-«) _ e -(<?+-9 / +K) _ 2 sinh k 




The second row gives ip as a linear combination of g, 5k and a bounded multiplication operator 
acting on <p thus we restrict attention to the first row which is a first order, non- autonomous linear 
recurrence for (f>. The coefficient well behaved just like the similar coefficient 

we studied when verifying (Hl)-(Cl). In particular, the hypotheses of Lemma 2 are satisfied and 
we are in case 2. To verify (H2) we must show that the (2, 3) entry of the derivative matrix is not 
orthogonal to the kernel of the adjoint. At first glance it appears that one must dirty one's hands 
with the explicit form of the m-soliton solution in order to do this computation. However, it was 



12 



Hoffman and Wayne 



shown in [19] that the quantity of interest is independent of time and hence it suffices to analyze 
the quantity in the limit t — > oo where it reduces to the computation for the vaccuum-kink pairing. 
This computation is not difficult and has been checked in [19]. To check (C2) we again make use of 
the fact that a multi-soliton decomposes into the linear superposition of soliton solutions in I 1 . □ 
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